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We quantify the Fisher information content of the cosmic shear survey two-point function as a 
function of noise and resolution. The two point information of dark matter saturates at the trans- 
linear scale. We investigate the impact of non-linear non-Gaussianity on the information content 
for lensing, which probes the same dark matter. To do so we heavily utilize N-body simulations 
in order to probe accurately the non-linear regime. While we find that even in a perfect survey, 
there is no clear saturation scale, we observe that non-linear growth induced non-Gaussianity could 
lead to a factor of 4 reduction for the common Dark Energy figure of merit. This effect is however 
mitigated by realistic levels of shot noise and we find that for future surveys, the effect is closer to 
a factor of 1.5. To do so, we develop a new scheme to compute the relevant covariant matrix, ft 
leads us to claim an unbiased estimator with an order of magnitude improvement in accuracy with 
only twice more simulations than previously used. Finally, we evaluate the error on the errors using 
bootstrap methods. 

I. INTRODUCTION 

o 

What is the information content of cosmic shear surveys? Although this question has been addressed many times 
given the current interest in cosmic shear surveys, to answer it accurately turns out to be a non-trivial task. It is the 
■ purpose of this work to offer one answer to this question. 

The measure of cosmic shear (e.g. 0, Q for reviews), is considered to be one of the most promising observational 
tools to understand the origin of the accelerating expansion of the universe [H . Commonly attributed to the existence 
of some extra unknown physics loosely labelled Dark Energy (DE) , its exact nature became one of the salient questions 
in contemporary cosmology [4J, |5|, [y, [j], la, |3, [lCJ . Characterising the physical properties of DE is the main scientific 
driver for the development of new and very ambitious surveys. 

Our work was motivated by recent investigations of the information content of the 3D matter power spectrum as 
quantified by the projection of the Fisher information on the amplitude (Til [T2I Il3l. [l4j . Quantifying the information 
through this well defined statistic (insightful even if restrictive), the answer obtained in these papers was somewhat 
surprising. While a Gaussian like behavior was observed on linear (large) scales, the non-linear growth of structures 
entails an information saturation at mildly non-linear scales. A quasi-Gaussian behavior was recovered once fully 
in the non-linear regime but at a substantially lower level. When devising a survey, it means that optimizing the 
survey to gain sensitivity and resolution in the trans-linear regime (where the power spectrum is currently interpreted 
cosmologically), would not entail much pay-off if one were focusing on this statistic only. Despite the fact that the 
effect of the non-linear growth of structures had been widely studied before these works, formulating it this way led to 
this surprising answer, heuristically understood within the context of the halo model [T4j . Neyrinck and Szapudi (To1 | 
later showed that if we project the Fisher information into other parameters, analogous behavior are then observed. 
This validates the insightful value of the amplitude projection. As a consequence, for the sake of simplicity, we will 
loosely call information the projection of the Fisher information onto various subspaces (amplitude or DE statistics). 
And although we will define all our statistics precisely, their label as information is definitely restrictive. We will 
look at them as a way to highlight the departure from the Gaussian behavior usually assumed when forecasting the 
constraining power of those survey. Beyond this departure, to quantify fully the information content in the non- 
Gaussian regime is a task we will not endeavor in this paper. This paper explores the variance of the power spectrum, 
which is a 4-point statistic of the density field, and its variance, which is an 8-point statistic. Potentially, higher order 
estimators, e.g. 3 or 4 points functions, could contain additional information for non-Gaussian fields. The calculation 
of those errors, and errors on errors, is substantially more challenging. Early investigations [Tt3 | indicated that those 
errors grow rapidly, making their use challenging. 

Whereas new cosmic shear surveys are being advocated [1] and designed [TtJ, we want to study in this work how 
this 3D information saturation translates into cosmic shear observables. In particular, one question we would like to 
answer is whether there exists a scale above which the Fisher Information for the two-point functions (projected onto 
the amplitude or the DE Figure of Merit) saturates. Since above a given angular scale [I > 2000), our lack of precise 
modeling of the physics of baryons might require enormous efforts to be addressed (TR[l5,l20j. it would be interesting 
to know whether such a saturation happens and in particular how it compares to this theoretical uncertainty scale. To 
tackle this question will require to compute the cosmic shear error b udg et in the fully non-linear regime. While this 
questions has already been investigated in the literature [!]], [22|, [23|, [24j, we will address it using numerical N-body 
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simulations to probe accurately the full non-linear regime (still neglecting baryons though), and a new way to build 
the covariance matrix from those quantities. This will lead us to an order improvement in accuracy as compared to 
previous numerical works in the literature. We will quantify this statement by measuring the errors on the errors 
using bootstrap techniques. 

In this paper, we first begin by introducing the methodology of our work in Sec. [II] before revisiting the 3D matter 
power spectrum results in Sec. IIIII as an introduction to the cosmic shear case developed in Sec. IIVI We discuss in 
Sec. [V] the practical consequences of these results for current and coming optical surveys, as well as for CMB lensing. 



II. METHODOLOGY 



If we want to infer a set of parameters at from observables x of dimension n x following a multi-variate Gaussian 
distribution with a covariance matrix C — (xx 1 ) — (x)(x), the Fisher Information matrix is defined as [25L [2II \2l\ 
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Its relevance for parameter estimation can be seen from the Cramer-Rao inequality stating that the Fisher matrix 
sets a lower bound on how well a parameter ai can be measured, that is o 2 {a i ) > 1/F^. We assume from now on 
that the covariance matrix C does not depend on at (for a discussion in the context of cosmic shear, see [28]) and, 
following Rimes and Hamilton (llj . we define the information content of x as 

In this paper, the observables we will consider will be either the 3D matter power spectra, x = -P(k), or the 2D 
convergence power spectra x = C% defined in Eq. 1101 The parameters we will focus will be the standard cosmological 
parameters for a flat cosmological model whose density is dominated at late time by Dark Energy (DE) whose 
equation of state evolves as w = wq + w a (l — a), a — (wq, w a , u> m , u>b, n s , erg) [3]. The nominal value for those 
parameters correspond to the currently favored model, a — (1., 0., 0.1334, 0.0228, 0.963, 0.796) 0. 
For pedagogical reasons, we will also consider a dimensionless version of Eq. [2] 

^-EC?, C-^y. (3, 

This form would be obtained from Eq. [2] if we were measuring the amplitude of a template P(k) (or Cf), so that 
the partial derivatives were unity. Note that because of non-linear effects, this amplitude does not correspond to as 
or the curvature perturbation amplitude As- To put it otherwise, what we will define as information in Sec. IIIII and 
IIVI corresponds to the variance on the amplitude parameter, a, if the observables were to be modeled as x — ax. 
We find this projection of the Fisher Information matrix onto this space to be a convenient quantity to visualize the 
property of this matrix. Given this definition, since we are interested in quantifying the effects on non-linearities on 
the information content of the measured convergence angular power spectrum (or matter power spectrum), it will 
be particularly insightful to investigate the scaling of Inf with a cut-off scale £ ma x (k m ax), that is the cumulative 
information content as function of the smallest (angular) modes measured. The comparison between the scaling on 
large scale (low I and k) where the convergence (matter) field is expected to be Gaussian to the one in the non-linear 
regime (high I and k) will thus be of particular relevance. For this purpose, we define in the matter power spectrum 
and angular power spectrum case (at wavenumber fc& or multipole I), 

Ck b k b ,<k ma!C = 7p~y7]5t~y ' Infikmax) = ^k b \ b , > ( 4 ) 
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The definition of Inf in Eq. [3] is particularly easy to interpret, since in the Gaussian case, where ocx^jS^, it 
directly reduces to half the number of measured modes. We thus have a simple analytical predictions for the expected 
scaling on large scales. Note that we choose to define Inf(£ max ) by imposing a sharp cut-off in Fourier space. An 
alternative definition consists in marginalizing over all the modes above k max (£ m ax) by adding a white noise level so 
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that for example the signal to noise ratio equals 1 at k = k max (£ —(-max)- We checked that both approaches give 
equivalent results. 

As visible from Eq. [3j the key quantities to evaluate the information content of our observables is the covariance 
matrix. For this purpose, we will use a Monte-Carlo approach and generate n s i m realizations of the observables, Xk, 
through N-body simulations including dark matter only. We make use of the publicly available CubePM code [49| . 
CubePM is the successor to the particle-mesh N-body code PMFAST [29] 50] . In addition to the features provided by 
PMFAST - support for distributed memory systems through MPI and shared memory via OpenMP, minimal memory 
overhead and communications requirements - CubePM contains support for gas evolution through use of a TVD MHD 
module, particle-particle interactions at sub grid cell distances, optimal scaling up to (and hopefully beyond) 1000's 
of nodes, as well as shared-memory parallclization via OpenMP to optimize memory usage on shared memory nodes. 

Given a set of n sim realization for x^ that we write, xt~ '"' n "'™ ! we define as an estimator for C (see [131 ] for a 
thorough discussion on how to measure C from one simulation only) 
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At this point, it is often missed that the inverse of a maximum- likelihood estimator for a variable X is in general 
not an unbiased estimator of the inverse X~ l [301 ] . To remedy this fact, a corrective factor is required. Since we also 
evaluate \i from our simulation, it can be shown that the following estimator for C _1 is unbiased: 

C" 1 = n ""-"*- 2 c-i, (8) 

In the case we are interested in here, i.e. the the convergence angular (cross-) power spectrum covariance matrix 
(Cg* 1Z2 Cg* 3Z4 ), we found that the number of independent modes, n x , is not easy to define. Thus we dropped this 
corrective factor. We carefully checked the convergence of our results by increasing n s i m (see Fig. Q] and Fig.[3|). 

Furthermore, to quantify the error on our statements, we will evaluate the errors on C _1 by making use of the 
bootstrap method [311 ■ We will consider 1000 sets of n S i m simulations randomly drawn from our n s i m simulations 
and apply the above defined formalism to each. This procedure weighs in a random manner our initial set of N-body 
simulations. Even though it is unclear wether the number of independant realizations, i.e. simulations, we have is 
enough for the bootstrap method to be reliable it still gives us a valuable glimpse at the reliability of our statements, 
that is on the error on the error. 



III. MATTER POWER SPECTRUM INFORMATION CONTENT 

We first focus on the matter power spectrum and revisit the results of Rimes and Hamilton [l^]. For this purpose, 
we ran 400 N-body simulations with their choice of cosmological model, that is a flat ACDM model with Q, m = 0.29, 
Qa=0.71, Of, = 0.046, og = 0.97 and /i=0.71. Since we are interested in trans-linear scales, i.e. the transition regime 
between the fully linear regime and the fully non-linear regime, a comoving box size of 256 Mpc/ft, with 256 3 grid 
points is appropriate. This gives us roughly a mass resolution of 9.2x 10 11 M Q and a force resolution of 1 Mpc//i. The 
initial conditions were generated at z = 200. 

To improve convergence, to increase the rank of C and to alleviate numerical issues when performing the inversion, 
we define n x = 20 bins logarithmically spaced in k space and we measure the average power spectrum within a k bin, 
x = P as 

^{S^) keb = k 3 b P b . (9) 

Vbox 

where Sj, is the Fourier transform of the matter over-density. We then make use of Eq. 0] and [5] with n sim = 50, 100, 
200, 300 or 400 and n x = 20 to compute the cumulative information content for various k max . The results at z = 1 
are plotted in Fig. [T] as well as the Gaussian prediction using the measured number of modes in the simulation, 

_ Q 

Inf (k max ) = J2b<k £ ^) {X kmax- Qualitatively similar results are obtained at different z. 

First, we notice that the convergence in terms of the number of simulation used to compute Inf seems satisfying 
although not perfect. We estimate that the lack of convergence adds extra-uncertainties of the same order as the 
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FIG. 1: Cumulative information for the matter power spectrum at z = 1 as defined in Eq. [4] We use either n s ; m = 50, 100, 200, 300 
or 400 simulations. The bootstrap error bars are obtained using an analysis of 1000 sets of 400 simulations. The dashed orange line 
corresponds to the Gaussian prediction, i.e. the number of k modes present in the simulation below k max . Although the convergence 
does not appear to be perfect, the difference between the measurement using 300 or 400 appear smaller than the error bars everywhere. 
As such we can trust the results obtained with 400 simulations. This plot reproduces the results of Rimes and Hamilton |l2fl . 



error we estimated using bootstraps. As discovered by Rimes and Hamilton [12j |. the two remarkable features of 
this cumulative information are the following. Whereas on linear scales, say k < 0.1 Mpc//i the information content 
follows the scaling expected from a Gaussian random field, a sharp transition to a plateau is observed on trans-linear 
scales, and a return to a quasi-Gaussian scaling (but with a lower amplitude) in the fully non-linear regime. These 
features mean that the information, say on a primordial amplitude is conserved and could be measured with an 
accuracy directly proportional to (# modes) -1 / 2 on linear scales, not much more is learned on trans-linear scales, i.e. 
the information is redundant with the one contained in linear scales. On the other hand, a quasi-Gaussian scaling 
is reestablished in the fully non-linear regime. The sharp transition from the linear to the non-linear regime can 
qualitatively be understood in the halo model framework, where it corresponds to the transition from the 2 halos 
term to the 1 halo term Whereas on large scales, the information is contained in the 2 halos term and scales as 
the number of modes measured (or the number of halos in a given volume), on trans-linear scales, the 1 halo term 
starts to dominates but with a large variance since most of its contribution comes from rare massive halos. This large 
variance explains why it is hard to extract any information supplemental to the one obtained in the linear regime 
from this regime. On smaller scales though, the 1 halo term contribution comes mostly from numerous smaller mass 
halos whose number is much more constant, i.e. fluctuates with much less variance, and the information scales again 
roughly with the number of modes probed. 

Now that we have reproduced and introduced the key results regarding the 3D matter power spectrum we move 
to the original results of this paper, that is how this information saturation effect in the matter power spectrum 
translates into some integral of it, i.e. into the cosmic shear observables. 



IV. COSMIC SHEAR INFORMATION CONTENT 



We now make use of n sim = 300 N-body simulations run with our nominal cosmology to investigate the information 
content of cosmic shear surveys. To quantify the information, we choose as our observable the convergence cross-power 
spectra between two redshift bins Zi and Zj that we defined as P, Q 

f°° H(z) 
h Zi h Zj C z ^ = J dz W z >(z)W z >(z)-^P(£/D(z),z) , (10) 
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where H(z) is the Hubble parameter, D(z) is the angular diameter distance, P(k, z) is the 3-dimensional matter 
power spectrum at redshift z. The lensing kernel is defined as 

W«{z) = |nm^^(l + z) [ dz' n Z] {z>) Dl ^' ] , (11) 

dz n z .(z) . (12) 



o 

where D LS (z, z') is the angular diameter distance between z and z', n z . (z) is the galaxy distribution in redshift bin i 
and n z . is the total number of galaxies in this redshift bin. 

In this section, for the sake of simplicity, we will consider a uniform galaxy distribution in n 2 = 1, ... ,4 redshift 
bins of width Az = 0.5 and defined as 1.0 < z < 1.5, 1.5 < z < 2.0, 2.0 < z < 2.5 and 2.5 < z < 3.0. Each distribution 
is normalized to unity, i.e. n z . = 1/Az and h z . — 1. This choice is motivated by our interest in low redshift diffuse 
lensing of the diffuse radiation originating from the 21cm line emission of galaxies [13, S|- We will consider more 
realistic galaxy distribution functions when discussing specific surveys in Sec. |Vj As discussed in a companion paper 
[HI], we found that simulations with a box size of 200 Mpc/ft., a 1024 3 grid, and 512 3 particles are close to optimal for 
our needs. Each one of these simulations takes about 4.5 hours using 8 nodes (64 cores) on CITA's Sunnyvale cluster. 
We checked that finite resolution effects do not affect the convergence power spectra up to £ ~ 10000 that will define 
the smaller angular scale we consider in this work. This box size corresponds roughly to an area of 56 square degrees. 
As a consequence, when considering the dimensionless cumulative information, the sum is performed for £ > 50. 

To compute the covariance matrix using n S i m = 300 simulations we employ an original method that avoid the 
artifacts present in previous methods. The now standard approach to simulate cosmic shear has been pioneered in 
pH [3S| . It consists in ray-tracing through a light cone build out of a collection of N-body simulations outputs at various 
redshifts. This method has been widely tested and its limitations (angular resolution, periodicity, mass resolutions, 
etc.) quantified [U [H, [H, [H, H3] • It provides great reliability, e.g. to produce k maps in the observational regime 
of interest nowadays. However, it is important to notice that given our box length, from the observer at z = till the 
most distant lens plane at z = 3, 24 boxes are required. As such, in principle, one could build only 12 fully independent 
light cones out of our n s i m — 300, a number far from enough for our Monte-Carlo approach to compute C^ 1 . A 
common fix consists in using the same simulations more than once in a given light cone after random translations and 
rotations of the original box. While it does increase the number of light cone realisations that can be generated with 
a given number of N-body simulations, it introduces spurious correlations - density field from a same simulation at 
different redshift are not independent - which are hard to control safely. In fact, from the Limber approximation we 
know that the combination of shifting, stacking and recycling will lead to the correct power spectra since it is a linear 
function of the density field at each redshift with random phases. But this does not hold anymore when considering 
covariance matrices. To remedy this problem, we follow an original approach described below. 

The basic idea goes as follows. We first compute the covariance matrix of the convergence (cross-) power spectra 
for each output boxes combination by averaging over all the n S i m sims. The final convergence matrix is then an ap- 
propriately weighted sum of the covariance matrices computed at each output redshift. More formally, this procedure 
can be written this way. To compute the convergence power spectra, for each output boxes at a given redshift z Sl , 
we project on a randomly chosen side the over-density field, 5(x, z Sl ), Fourier transform it and measure its 2D power 
spectrum, <52d(k, z Sl ). After converting the comoving wavenumber k to an angular multipole £ — kD{z Sl ), we weight 
the power spectrum by the lensing kernel W Zsl to transform it into an angular convergence power spectra at z Sl and 
bin it in nt = 12 bins: 

£»z a i,z a2 _ ^~pi~p 2 } e±Ae 5 (13) 
where the average is taken over the Ng contributing to this band power, £ — A£ < £' — kD(z Sl ) < £ + A£, where 

k T =J2^d(t = kD(z Sl ),Zi)W z *i(zi) , (14) 

i 

and where W Zsl is the lensing kernel for each slices if the sources are distributed in band z s \ and the sum over i 
denotes a sum over outputted simulation boxes. It follows that to compute the convergence cross power spectra 
covariance matrix, 

Cov(c* sl ' Zs2 ,c z e ; 3 ' Zs4 ) = (c Zsl ' Zs2 c z ; 3 ' Zsi ) - (c Zsl ' Zs2 )(c z ; 3 ' Zs4 ) , (15) 

we need to compute using our n S i m simulations, both 

V^l °f I sim — \ K e K e \ K £> K £' ) I sim 
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FIG. 2: Dimensionless cumulative information for the convergence cross-power spectra defined in Eq. \E\ We here replaced the projected 
density field in each output box by a Gaussian field with the same power spectrum as the one measured in the N-body box. Each color 
corresponds to a different sub-set of redshift source bands of size 1, 2, 3 and 4. The dashed line corresponds to the matching Gaussian 
predictions using the number of modes measured in the simulations. 



(17) 

and 



V^t lsimV~'l> I sim ~ K^i /simX^e' n e> I sim 

= Y. WZsl ( z ^ w '" (*i ) &d (I,**, )6* 2d (£, Zil )) sil 

. il 

£ W z °* (z l2 ) W" (z h ) (5 2d (£', z i2 )~5* d (£', z t2 )) s 



(18) 



(19) 



Note that care has to be taken regarding the complex conjugates. Whereas the expectation value for the cross-power 
spectra are real, the estimator of the cross power spectra are complex. 

Note that if I ^ the cross terms (S 2d (£, z)S 2d (£', z)) sim do not contribute. After some simple arithmetic, the 
calculation simplifies to 

Cou{C' t ' 1 A2 , = Y^W** 1 (z h )W z >* (z h )W z >° (z h )W z °* (z k ) 

ii 

- (hdii, z tl )s 2d (e, z ll )) svm (s 2 d(£' , z n )s 2d (e', z n )) sim 

+ W'«(z i JW z '*(z ia )W*'>(z il )W'«{z ia ){^ 

il,i2,il^i2 

(20) 



(S 2d (£, %i)^2d(^; z i^)b 2d (i\ Zi^)8 2d (f , z it )) 



For the sake of clarity, we omitted the sum over £ and £' modes within a band power. As compared to the common 
approach that consists in building light cones, from which kappa maps and the associated angular (cross-) power 
spectra are build, the advantages of our method are two-folds. First, we ensure that there is no contamination due 
to the recycling of boxes from the same simulation. While it has been tested that this contamination is a small effect 
at the power spectrum level [36| . it does introduce biases when computing the four point functions of interest to us; 
those biases have not been properly quantified yet. In fact, as was said earlier, the Limber approximation guaranties 
the former, it also is generically biased. A upward fluctuation in the initial condition will lead to correlated upward 
fluctuation at all redshifts. In our scheme, cross terms between redshifts are explicitly not present. 
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FIG. 3: Dimensionless cumulative information for the convergence cross-power spectra defined in Eq. \E\ when considering either one 
redshift source bin (1< z <1.5, left plot) or four (1< z <1.5, . . . 2.5< z <3.0, right plot). Each color corresponds to a different number of 
simulations (100, 200 or 300) used to compute the covariance matrix. The weak scatter amongst those different points allows us to assess 
the satisfying level of convergence we obtain with 300 simulations. 



Similarly, while assuring an unbiased Fisher matrix, this scheme could still result in a bias on its error, since the 
output of the same n S i m simulations is used to compute the covariance matrix of the convergence power spectra at 
each simulation output redshift. Second, the rate of convergence with the number of simulation is much faster than 
with the usual light-cone approach. Imagine we need ribox to build a full light cone (ribox = 24 in our case). The 
number of independent light cone is thus n S i m j ribox and the rate of convergence for the covariance matrix goes as 
\J nbox/n S i m . In our case, the convergence rate for each covariance matrix at each output redshift goes as 1/ y/n S i m . 
Furthermore, on small angular scales, because the non-linear evolution makes the (high) k evolution quite independent 
from one redshift to another, we gain an additional factor l/^/nb ox so that the convergence is closer to l/y/n S i m n,i, ox . 
This effect is less important on large angular scales (linear) however, on all scales, the 1 / w 'ribox scaling will still be 
regained because various k modes will appropriated to various t bin so that when the errors will still average down 
as 1/ yjribox when computing the final covariance matrix. We thus claim a convergence improvement close to 1/ ribox 
(= 24 in our case) as compared to the standard method, i.e. one order of magnitude improvement (see Fig. [3] and 
discussion in Sec. |V|) . 

For cross-checking purposes, we compared all our results with the usual light cone method. We observe a satisfying 
agreement between both. We also clearly observe the different convergence speed as n sim grows. 

Once the n x = nen Zbin (n Zbin + l)/2 dimensional covariance matrix has been computed, we apply Eq. [5] to obtain 
an estimate C _1 . As a first check, we consider a set of Gaussian simulations where the 2D projected density field at 
each output redshift is replaced by a realisation of a Gaussian field with an identical spectrum as the one resulting 
from N-body simulations. In that particular case, given the exact number of modes in each £ shell, we can predict 
exactly the scaling of the dimensionless cumulative information and compare it with the measurements. We consider 
different number of source bins and all the associated angular cross-power spectra. The results are displayed in Fig. [2] 
and an excellent agreement is observed between the analytical predictions (dashed line) and the measured quantities 
(symbols). 

We then move to the genuine N-body simulations and look at the dimensionless information for either one redshift 
source bin (left panel of Fig. [3]) or four redshift source bins (right panel of Fig. [3]). In both plots we check the 
convergence of our result by varying the number of used simulations and compare it with the Gaussian predictions 
(red dashed line). We considered respectively 100, 200 and 300 simulations. Obviously the convergence is satisfying 
and much smaller than the effects we are interested in, i.e. the difference between the red dashed line and the symbols. 
As discussed in (34[, when assessing the convergence of a Monte-Carlo estimator of C _1 is it important to look at 
C _1 (or its norm) and not the diagonal of C as in [38|. The relative difference between the 200 and 300 simulations 
computation, probably an upper bound on the convergence, is consistent with our convergence estimate. It is around 
8%. 

This satisfying convergence is crucial and gives us confidence in the results displayed in Fig.[3J We can now interpret 
them. First, it is interesting that the saturation effect present in the 3D power spectrum (see Fig. [TJ) also appears in the 
convergence power spectrum. As expected, whereas N-body results agree with Gaussian predictions on large (linear) 
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FIG. 4: Dimensionless cumulative information for the convergence cross-power spectra defined in Eq. \5\ Each color correspond to a 
different sub-set of source redshift band of size n z = 1, 2, 3 or 4. The dashed color line corresponds to the Gaussian prediction for the same 
number of modes. As in 3D, the effect of non-Gaussianity is clearly visible as a drop in the dimensionless information content when one 
enters the slightly non-linear regime. The scaling becomes Gaussian again in the fully non-linear regime. As expected, since non-linearities 
increase with redshift, the effect is more severe the lower the source redshift is. 



angular scales, a departure from the Gaussian behaviour (red dashed line) appears at sub-degree scales (£ > 300). Not 
surprisingly, for sources between (1 < z < 1.5), this corresponds to k ~ 0.2/i/Mpc at z — 0.5 where the lensing kernel 
peaks, consistent with what is observed for the 3D power spectrum (at z = 1) in Fig. [3] When higher redshift sources 
are included, we are sensitive to higher redshift, we expect the departure from Gaussianity to be milder (non- linear 
evolution decreases with increasing redshift) and at smaller angular scales (higher £). This corresponds to what is 
observed in the right panel of Fig. [31 

This is illustrated furthermore in Fig. [4] where we consider other source redshift distribution. Both the agreement 
on large scale with the Gaussian prediction and the saturation effect shift to higher I with increasing source redshift 
are clearly visible. So is the decreasing of the saturation effect as the source redshift increases and when tomography 
is included. As for the 3D case, the information increases in the fully non-linear regime, at a substantially lower value 
than Gaussian. Note however that the saturation effect is less dramatic than for the 3D case since the projection 
inherent to lensing introduces an extra Gaussianization. 



V. DISCUSSION 



Now that we highlighted above the dimensionless cumulative information for some perfect idealized survey of angular 
area the corresponding to the size of our simulation, we discuss its implications for current and future optical surveys. 
In particular, we quantify the consequences of the information saturation effects discussed above on the cosmological 
information content. We focus on DE that we parametrize by an evolving equation of state w(a) = Wo + (1 — a)w a . 
We use as our main statistic the DE figure of merit (FoM) [3j, that we define as the area of the 95% contour ellipse in 
the wo — w a plane. Following the definition of the Fisher matrix, F, in Eq. [TJ if we consider the following observable 

V a =z i (zi-i)/2+z j ,t = n Zi n Zj Ce , (i>j) (21) 
and a set of six cosmological parameters a M . The Fisher information matrix writes as 

(=2 ab 



With this notation, cr(a M ) = WF 
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FIG. 5: Dark Energy Figure of Merit as a function of scales for 4 various surveys whose parameters are given in Tab.|I] The blue diamonds 
correspond to the predictions using non-Gaussian covariance matrix and a shot noise contribution. The red diamonds correspond to the 
Gaussian approximation to the Fisher matrix with shot noise. The orange diamonds correspond to the non-Gaussian case with the prior 
expected from the Planck satellite. The blue and red dashed curves correspond respectively to the non-Gaussian and Gaussian cases, 
without shot noise. Whereas we can see that the inclusion of non-Gaussian error bars is important for a perfect noiseless experiment (blue 
dashed curve) as compared to the perfect noiseless Gaussian errors (red dashed curve), it is less critical when adding the shot noise coming 
from the dispersion of intrinsic ellipticities (comparison between blue and red diamonds). 



We will still make use of the 300 simulations introduced in Sec. IIVI and customized for the diffuse 21cm lensing 
[33 | to discuss optical cosmic shear. We do so by re-weighting the previous results obtained using the wide uniform 
redshift bins with the relative weights appropriate coming from a realistic optical galaxy density 

n(z) oc z a e~ [ - z l z ^\ (23) 
with a = 2 and /3 = 1.5. (24) 

hi corresponds to the total number of galaxies in the ith redshift bin. We also rescale the signal covariance matrix by 
the survey area considered. Note that in this work, we will ignore the uncertainties in the number density, however 
important they are [39| |. We will consider four surveys whose parameters are given in Tab.|TJ They roughly correspond 
respectively to a current survey like CFHTLS [5l| . the soon on-line DES [52j survey, and wide and deep space survey 
like Euclid [53| or JDEM [54[. We will normalize n(z) so that the galaxy density matches the one given on the second 
line of Tab. [J and we will consider the shot noise coming from the intrinsic ellipticities of objects. Note that our 
predictions here are somewhat inaccurate due to the fact we are re- weighting our 21cm simulations and also our ideal 
survey is somewhat suboptimal since we consider only 4 redshift bins to perform tomography, which has been shown 
to be slightly sub-optimal • However, our treatment is accurate enough to discuss the effects of non-Gaussianities 
that are the focus of our study. 

As a reference point, we will also compute the Fisher matrix with a Gaussian approximation to C _1 . Doing so, 
we follow the formalism laid out for example in [40]. In the Gaussian case, assuming a shot noise level and simple 
Gaussian sample variance, the covariance matrix is defined as 

C ab i = hihjhkhi (CT * ik C, tot jl + C\ ot il C\ ot jk ) , (25) 
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FIG. 6: Bootstrap errors on the ratio between the noiseless FoM curves for the Type IV survey. We are thus plotting the ratio between 
the red and blue dashed curve in the lower right panel of Fig. [5] 



where a — i(i — l)/2 + j, b = k(k — l)/2 + I and where the total power spectrum is 

Cf tij ' = C£ ij '+<%^, (26) 

where 7; nt is the rms shear error per galaxy per component contributed by intrinsic ellipticity and measurement error. 

Fig. [5] compares the Gaussian and non-Gaussian cases for the 4 different surveys whose parameters are defined in 
Tab. HI Going from the Type I survey to the Type IV survey, we increase simultaneously the number of galaxies, the 
depth, the number of source redshift bins and the survey area hence an increase in FoM. As we previously did for the 
dimensionless information content, we will now study the evolution of the FoM as we increase the number of modes, and 
go from the linear regime to the non-linear regime. This is a proxy to quantify the cosmological interpretation in this 
survey as we increase angular sensitivity. The red dashed lines corresponds to the noise free Gaussian approximation 
while the blue dash line corresponds to the non-Gaussian noise free evaluation. As we can see when comparing these 
two curves, the saturation effect discovered earlier in the cumulative information translates naturally in the evolution 
of the FoM with £ max . In the case of a noiseless survey, the difference in FoM at high angular resolution can be as high 
as a factor of 4, even when we consider four tomographic bins. This effect is thus important and in stark contrast 
with the scaling with £ ma x usually assumed in the literature (e.g. However, when introducing realistic levels of 

noise, the effect is somewhat mitigated as is visible when comparing the red diamonds (gaussian approximation, with 
noise) to the blue diamond (non-gaussian covariance matrix, with noise). It is still however non-negligible since the 
ratio between the Gaussian and non-Gaussian cases at higher £ max becomes closer to 1.6. This point constitutes the 
key result from our study. While potentially very damaging to the ideal performances of weak gravitational surveys, 
the effect of non-Gaussianity is tampered by the estimated level of Gaussian noise expected for current and future 
surveys. The inclusion of the Planck prior does not affect those conclusions at high £. 

Note also that despite the fact that our plot hint at the ability to measure the convergence power spectra up to 
£ < 10000, in practice, it will most likely be limited by theoretical uncertainties at £ < 3000, not the least by our 
inability to model the details of the baryon physics [H, [H, [2(| • 

Consistent conclusions were reached in the halo model based analytical approach followed in [22|, H3] . We will discuss 
the latter as being the most recent and refer to its discussion of the earlier results of [12] • Our results seem consistent 
with their evaluation of the signal to noise ratio (somewhat comparable to our dimensionless information). So does the 
effects on parameters when considering realistic surveys although our factor of 1.6 for FoM seems a bit higher. Note 
however that the comparison is not direct since the set of parameters we consider is slightly different. Whereas we 
consider a flat Universe with 6 parameters including an evolving DE equation of state, they consider non-flat models 
and allow the spectral index to run. We did not translate our evaluation of the cross-power spectrum covariance 
matrices to real space statistics and therefore we refer the reader to the discussion of 24] for a comparison with real 
space evaluation of this effect, as e.g. (23|. A more detailed comparison between analytical estimates and numerical 
work would be instructive and has to be performed but we leave it for further work. So do we for the dependence of 
our results on cosmological parameters, mostly erg, n s and f2 m [28|. Note that we also do not investigate the so-called 
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FIG. 7: Dimensionless cumulative information content as a function of maximum multipole for a source plane at z = 1000. A mild 
saturation effect is still visible. Since the CMB lensing is sensitive mostly to the growth of structure at z ~ 2.5, the saturation effect is 
milder and shifted to smaller angular scales (higher Imax). 



"beat-coupling" effect [13|,|24|. As we understand it, it denotes the extra-mode coupling induced by the finiteness of 
the observed volume, i.e. such an effect would be non-existent if the all-sky was considered. Since the exact form of 
this coupling will depend on the exact mask of any given survey, we decided to ignore this effect here. In practice, as 
it is usually performed for example while analysing CMB data, we would start from a mask description in real space 
and propagate the induced mode-coupling in Fourier space throughout all our calculations |42ll with special care to 
the peculiarities of non-Gaussian statistics. With their own prescription, Takada and Jain [24| found that it does not 
affect qualitatively the effect of the non-linear growth of structures. 

The comparison with the numerical work of (43j is also not obvious. First, a different cosmology with a higher 
us is considered, which should enhance the non-linear effect. Second, they consider 200 simulations and a standard 
tilling technique that should give rise to an accuracy of at most at most 85% on the errors using the scaling formula 
of Takahashi et al. (38J. All those reasons make a direct comparison a bit difficult. 

To quantify further the error on the errors issue, we evaluate the errors of the FoM for the Type IV survey 
introduced before using a bootstrap method discussed above. We display the results in Fig. [6] where we plotted the 
relative difference between the Gaussian and non-Gaussian FoM as a function of i max . Due to the strong convergence 
properties of our covariance estimation technique, we quantify the error on the errors to be around 25% on large 
scale and sub-percent on the smaller scales we consider. This fact certainly constitutes an improvement over previous 
results in the literature and is consistent with our estimate of a 1/ \fn S i m nbox convergence rate. If we use the scaling of 
12/ ' ^Jn s i m measured by Takahashi et al. [381 ] . we claim an overal uncertainty due to the limited number of simulations 
to be around 14%. 

Besides the optical observations of weak-gravitational lensing, the cosmic microwave background (CMB) constitutes 
another source plane where lensing can be observed (see [44[ for a review). As both the resolution and the sensitivity of 
detectors improve, it can now be measured and it defines the next frontier for the CMB temperature and polarization 
measurement [45| . Using cross correlation between WMAP data and other tracers of large scale structures to increase 
the signal to noise, a detection of gravitational lensing in the CMB temperature has been achieved with marginal 



TABLE I: Optical surveys considered 



Survey 


I 


II 


III 


IV 


Area (deg. 2 ) 


200 


200 


5 000 


20 000 


rigai (#/arcmin. 2 ) 


30 


30 


50 


100 


(7) 


0.25 


0.25 


0.25 


0.25 


# redshift bins 


1 


3 


3 


4 
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significance, i.e. around 2.4 a [46l |47| . |48| . A direct detection in temperature is expected to be achieved soon with 
high significance thanks to on-going high angular resolution temperature surveys (ACT, SPT, Planck). It is thus 
interesting to evaluate the effects of the non-linear growth of structures on the CMB lensing signal. To do so, we 
compute the cumulative information as a function of maximum angular scale t m ax, when considering one redshift 
source plane at z = 1000. The result is displayed in Fig. [7] Interestingly, the saturation effect is still visible, and not 
surprisingly, since the CMB lensing kernel peaks around z~ 2.5, the effect is smaller and shifted to smaller scales. 
In practice however, given that the CMB reconstruction is most likely going to be limited by secondary anisotropies 
(kinetic SZ in particular and patchy reionization) around a few £ ~ 3000, it is unlikely that this effect of non-linear 
growth will be a critical effect. 

To conclude, non-Gaussian effects are potentially very important for weak-gravitational surveys and might alter 
significantly the forecasts done so far. When considering realistic noise estimate for the coming optical surveys, the 
impact of non-Gaussian error bars is much milder. However, an interesting indirect consequences of our study is the 
sub-Gaussian scaling of information in the fully non-linear regime (this was also observed in the 3D power spectrum 
[I2I]). This fact highlights the great information gain that can be make by studying this regime, both using 2 point 
statistics but also higher order more specific to non-Gaussian effects. Nevertheless, it will require more theoretical 
insights. 
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